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FOREWORD 


The  Operations  Research  Center  at  the  Massachusetts 
Institute  of  Technology  is  an  interdepartmental  activity 
devoted  to  graduate  education  and  research  in  the  field  of 
operations  research.  The  work  of  the  Center  is  supported, 
in  part,  by  government  contracts  and  industrial  grants-in- 
aid.  Expenditures  associated  with  the  work  reported 
herein  were  supported  by  the  U.  S.  Army  Research  Office 
(Durham)  under  Contract  No.  DA-31-124-ARO-D-209. 


John  D.  C.  Little 
Director 


Browsing  mav  be  defined  as  a  search,  hopefully  seren¬ 
dipitous.  In  connection  with  a  library,  one  may  browse 
through  the  display  of  recent  books  to  see  what  is  new,  or 
through  a  portion  of  the  library  shelves  in  the  hope  of  finding 
a  text  which  might  contribute  the  fact  or  idea  needed  in  some 
intellectual  effort.  One  might  scan  quickly  through  the  fiction 
collection  to  see  whether  some  title  might  strike  one's  fancy 
or,  more  rarely,  might  thumb  through  the  card  catalogue  to  see  * 
whether  some  known  author  has  written  a  book  one  has  not  yet 
read.  In  each  case  the  browser  is  not  certain  he  will  find 
anything  of  use  to  him  but  he  has  hopes,  and  past  experience 
supports  that  hope.  Browsing  is  prevalent  in  most  libraries. 

In  fact  it  can  be  argued  that  browsing  is  one  of  the  most  frequent 
ways  in  which  the  library  user  finds  the  books  he  borrows.  To 
analyze  browsing  probabilistically,  to  see  whether  browser  or 
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librarian  can  improve  its  efficiency,  one  might  try  applying 
the  theory  of  search. 

Search  Theory 

Search  theory  was  developed  in  Aorld  War  II  in  connection 

with  antisubmarine  ■warfare*’.  Probability  theory  and  geometry 

suggested,  and  experimental  observation  verified,  that  there 

2 

was  a  fairly  simple  relationship  , between  the  chance  of  success 

\ 

in  spotting  a  submarine  in  a  given  area  of  the  ocean,  and  the 


degree  of  effort  spent  by  a  patrol  aircraft,  for  example,  in 
searching  the  area.  If  the  submarine  is  somewhere  in  area  A 
then  the  probability  of  success  P_  in  spotting  the  submarine  is 

D 

,  CD 

where  4y  the  search  coverage,  equals  pT/A,  the  search  rate 
of  the  plane  in  square  miles  per  hour,  multiplied  by  T  the 
hours  spent  in  the  area  and  divided  by  the  number  of  square 
miles  in  area  A  (e«2.7lt  is^tbAseof  rutfoM  towns).  The  search 
rate  p  depends  on  the  altitude  of  flight  of  the  plane,  its 
speed  and  on  the  search  method  (radar  or  visual)  and  equipment; 
it  has  to  be  measured  for  each  plane  and  equipment.  ' 

Figure  1  is  a  plot  of  P_  versus  4»  Note  that  even 
though  the  area  is  "covered"  (i.e.,  4  «  1)  it  still  is  not  certain 
(i-e.,  P_  is  not  unity)  that  the  submarine  is  spotted,  even  though 
it  is  there  and  on  the  surface.  Errors  in  navigation  will  leave 
some  areas  uncovered  while  other  areas  are  "oversearched" ; 
operators  and  equipment  are  fallible.  Poor  planning  and  main¬ 
tenance  often  lowered  the  chance  of  success  below  that  given 
in  Bq.  (1);  very  seldom  was  it  bettered.  Note  also  that,  in 
general,  the  coverage  is  proportional  to  the  time  spent.  It 
usually  turned  out  that  using  a  faster  plane,  to  search  the 
same  area  in  shorter  time,  simply  increased  the  number  of  times 
the  target  was  overlooked.  Particularly  in  the  case  of  visual 
search,  experiments  made  during  the  war  by  Selig  Hecht  (unpublished) 
showed  that  "haste  makes  waste".  Coverage  4  in  general  was 
proportional  to  the  time  spent  per  unit  area  of  scan;  it  didn't 
matter  much  whether  this  time  was  spent  by  covering  some  subarea 
thoroughly  or  else  by  scanning  cursorily  over  the  whole  area. 


Pig.l  Chance  of  spotting  one  book;  of 
resent  interest,  placed  at  random  in 
books,  as  function  of  search  parameter  pI/N 
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'  If  there  are  two  areas  of  the  ocean,  A-^  and  if  the 
probability  that  the  submarine  is  in  A^  is  and  the  probab¬ 
ility  that  it  is  in  A2  is  P2*then  the  chance  the  submarine  is 
spotted  is 

r8  -  p^i  -  •"*!)  +  p2(i  -  »~*2)  (2) 

The  search  coverage  4-^  of  area  A1  is  4^  *  pTj/Ap  with  T^  the 

time  spent  in  A^;  similarly  for  4  2  a  pT2/A2»  witk  T2  tlle  time 

2 

spent  in  A2»  Formulas  were  developed  determining  optimal 
allocation  of  search  time  between  the  two  areas,  in  order  to 
maximize  P  . 

Search  and  Browsing 

Let  us  now  apply  search  theory  to  the  "operation"  of 
browsing,  of  scanning  the  books  on  a  set  of  shelves  in  the 
library.  Suppose  the  shelves  contain  N  books.  The  chance  that 
the  browser  will  spot  a  particular  book,  placed  at  random  among 
the  N  books,  is 

P8  •  1  -  e"*  ;  *  -  pT/N  (3) 

where  T  is  the  time  spent  and  p  is  a  constant  that  might  be 
called  the  search  rate  of  the  browser.  Its  value  varies  from 
person  to  person  and  also  depends  on  the  accessibility  and 
illumination  of  the  shelves.  Its  value  for  a  particular  browser 
and  set  of  shelves  can  be  determined  by  running  a  series  of 
trials  (20  or  more),  eacn  run  for  the  same  time  T,  each  with  a 
different  target  book,  placed  at  random  in  the  collection,  to 
see  what  fraction  P8  of  the  trials  end  in  finding  the  book 
within  time  T  (N  should  be  at  least  1000  and  T  should  be  chosen 
so  P_  is  between  1/3  and  2/3  for  best  accuracy).  Knowing  N 
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and  T  and  estimating  f  from  the  trials,  p  can  be  obtained. 

3 

For  the  purposes  of  this  analysis  an  estimate  within  a  factor 
of  two  is  sufficient.  Measurements  made  by  the  writer  indicate 
that,  for  him,  under  good  lighting  conditions,  p  is  somewhere 
between  100  and  200  volumes  per  minute. 

To  apply  this  formula  to  browsing  we  have  to  reach  some 

conclusions  regarding  the  book  (or  books)  searchel  for.  In 

most  cases  the  browser  does  not  know  himself  wnirh  book  he  wi  11 

nick  out,  nor  indeed  whether  he  will  find  any  book  he  wants 

just  then,  even  if  he  spends  all  day  at  it.  Nevertheless  each 

regular  user  of  a  library  has  some  inkling  of  which  portion  of 

the  library  is  more  likely  to  yield  books  of  immediate  interest 

to  him.  If  put  to  it,  by  using  methods  developed  by  decision 
x 

theorists  ,  he  could  estimate  a  priori  an  expected  number  IS  of 
books,  of  interest  tc  him  at  the  moment,  that  might  be  present 
in  a  specific  section  of  N  books,  though  he  does  not  know  where 
in  the  section  the  books  might  be(nor,  ahead  of  time,  just  what 
book  it  might  be).  For  the  purpose  of  this  paper  it  is  suf¬ 
ficient  if  he  can  estimate  £  to  within  a  factor  of  2  or  J. 
Habitual  browsers  in  a  library  do  this  intuitively  when  selecting 
which  section  of  the  library  they  will  browse  in  luring  a  par¬ 
ticular  stay.  They  go  to  that  section  of  the  library  which 
they  estimate  has  the  greatest  likelihood  of  naving  a  book  they 
might  want  just  then  to  read.  Of  course  immediate  interests 
change;  a  particular  browser  may  have  a  completely  different 
set  of  values  of  the  a' a  next  time  he  visits  the  library. 

Thus  our  theory  indicates  that  the  browser,  during  s 
given  visit,  may  divide  his  search  among  M  different  sections 


of  the  library  (  perhaps  distinguished  by  general  subject  matter 
or  location),  spending  time  T1  in  the  first  section,  which 
contains  books,  and  so  on  for  the  U  sections.  If  he  does 
this,  the  expected  number  of  books  he  will  find  of  immediate 
interest  to  him  is  the  sum 

S  -  S,  +  S2  +  •  •  •  +  where 

_u  (*0 

Sm  .  E  (1  -  e  pT/N 

m  m  m  m  m 

As  mentioned  earlier,  Em  is  his  a  priori  estimate  of 
the  number  of  books  of  immediate  interest  to  him  which  might  be 
in  section  m  and  p  is  his  search  rate  (p  may  vary  from  section 
to  section,  but  this  complication  is  not  usually  worth  adding). 

Of  course,  in  any  particular  browse,  he  may  not  find  any  books 
of  interest  in  section  m,  or  he  may  find  33^;  search  theory 
indicates  that  Sm  is  his  best  a  priori  estimate  of  what  would 
be  the  result  of  his  spending  time  Tm  in  the  m’th  section.  We 
might  emphasise  the  probabilistic  nature  of  i>m  by  calling  it 
the  expected  success  of  his  proposed  expenditure  of  time  in 
scanning  the  m’th  section.  The  total  time  spent  browsing,  during 
that  particular  visit,  is  of  course  the  sum  T  •  T^  ♦  T->  ♦  •  •  •  ♦  T^. 

The  Browsers  Problem. 

On  the  basis  of  his  estimates  of  the  browser  has 
the  problem  of  distributing  the  total  time  T  he  wishes  to  spend, 
in  such  s  manner  as  to  make  the  total  expected  success  S  as 
large  as  possible.  That  this  is  s  meaningful  problem  is  due 
to  the  fact  that  aearch  is  subject  to  the  law  of  diminishing 
returns.  Figure  1  shows  that  doubj ing  the  time  spent  scanning 
a  given  section  does  not  double  the  expected  success.  In  fart 


if  enough  time  has  already  been  spent  so  that  4  =  pT/N  is 
larger  than  2,  increasing  4  to  4  by  spending  another  equal 
amount  of  time  in  the  same  section  can  only  increase  3  by 
about  another  ten  percent;  certainly  it  would  be  better  to 
spend  this  additional  time  in  scanning  another  section,  of 
equal  promise,  as  yet  unscanned. 

This  can  be  made  precise  by  asking  what  division  of 
tOoal  time  T  should  be  made  between  two  sections  of  equal 
numoer  of  volumes  N  each  and  with  equal  estimated  numbers  a 
of  books  of  immediate  interest  to  the  browser.  The  expected 
value  of  3,  if  the  visitor  spends  time  t  in  one  section  and 
time  T  -  t  in  the  other,  is 

3  .  SU  -  e-pc/li)  ♦  S(1  - 

which  is  plotted  in  Fig.  2  for  pT/N  *  2.  It  is  obvious  that 
the  maximum  is  reached  when  time  T  is  divided  equally  between 
the  two  sections  of  equal  promise  E,  though  the  flatness  of 
the  maximum  inlicates  that  it  is  not  very  important  that  the 
equality  be  precise.  The  symmetry  of  the  figure  indicates  that 
as  long  as  N  ..1  K  are  ejual  in  tne  two  sections,  the  time 
should  be  equally  divided,  no  matter  how  large  or  small  is  T, 
the  total  time  to  be  spent  browsing.  Tne  statement  can  be 
extended:  If  there  are  M  sections,  all  equal  in  regard  to  E 
and  to  N,  then  the  total  browsing  time  T  should  be  divided 
equally  among  all  Mi  sections,  spending  time  f/V  on  ench,  to 
achieve  maximum  expected  success.  Indeed,  if  one  has  no  idem 
what  might  be  useful,  so  that  the  density  of  expected  books  of 
interest,  E/R,  is,  a  priori,  the  s?ae  for  every  section  in  the 


'  M 
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library,  then  it  is  more  productive  to  scan, rapidly  and  more 
or  less  uniformly,  all  the  sections  rather  than  concentrate 
on  one  portion^. 

We  note  tnat  it  is  the  estimated  density  of  books  of 
possible  immediate  interest,  V  *  iS/N,  which  is  the  criterion. 

If  this  density  is  uniform  in  a  section  or  sections  of  the 
library,  if  the  e;  pected  book  or  books  of  possible  immediate 
interest  are  equally  likely  to  be  anywhere  in  the  section  or 
sections,  then  the  browser  should  spend  roughly  equal  time 
scanning  each  portion  of  them,  even  if  this  means  only  a  quick 
scan  a  Ion*  each  shelf.  Probabilistically  we  can  say  that  each 
took  in  the  section  or  sections  has  an  equal  a  priori  interest 
potential  V  =  S/N  lor  this  browser  for  this  visit,  and  thus 
deserves  an  equal  portion  of  the  scan  (until,  of  course,  a 
satisfactory  number  of  books  of  interest  have  actually  been  found). 

A  more  difficult  problem  arises  when  the  estimated 

density,  or  interest  potential  V  varies  from  section  to  section 

of  the  library;  what  then  should  be  the  allocation  of  time 

spent  in  browsing?  The  derivation  of  the  formula  is  given  in 
2 

the  reference  ;  here  we  need  only  translate  the  result  into 
terms  appropriate  for  book  browsing.  It  will  be  more  under¬ 
standable  if  we  start  by  applying  it  to  a  specific  case. 

Suppose  there  are  four  sections  which  have  promise  for 
the  potential  browser  this  visit.  The  first  section  has 
=  1000  volumes  and  the  a,  priori  estimate  is  that  it  might 
contain  *  3  books  of  immediate  interest.  Since  he  has  no 
idea  where  these  3  books  might  be  in  the  section,  the  prospective 
browser  must  assume  (until  he  finds  otherwise)  that  each  book 
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in  this  section  has  an  interest  potential  *  5/1000  *  0.003. 
The  values  of  N,  E  and  V  for  each  of  the  four  sections  are 


TABLE  I 

Sect. 

N  S 

V 

InV 

InV  +7*3 

InV  +  3.22 

1 

1000  3 

0.003 

-5.8 

1.5 

2.42 

2 

5000  2 

0.0004 

-7.8 

- 

0.42 

3 

5000  1 

0.0002 

-8.5 

- 

«< 

4 

9000  1 

0.00011 

-9.1 

- 

- 

ln(0. 00069)  -  - 

■7.3  ; 

ln(0. 

00027)  *  -8.2 

given  in  Table  I. 

We  first 

look  up 

the  natural 

logarithms 

the  interest  potentials  V  of  the  books  in  each  section;  this  is 
given  in  the  fifth  column  of  Table  I.  We  will  assume,  for  the 
purpose  of  the  example,  that  the  browser's  search  rate  p  is 
150  books  per  minute. 

The  situation  is  more  understandable  as  shown  in  Fig. 3* 
where  we  have  plotted  four  rectangles,  of  width  proportional  to 
the  respective  number  of  books  in  each  section,  and  of  height 
equal  to  the  corresponding  InV.  Now  suppose  the  prospective 
browser  has  only  T «  10  minutes  to  spend  browsing;  how  should 
he  divide  his  time  among  the  sections?  We  obtain  the  solution 
by  drawing  a  horizontal  line,  at  the  level  marked  lnA10,  such 
that  the  area  between  this  line  and  the  top,  heavy  line  of  the 
plot  is  just  pT  ■  150*10  »  1500.  This  area  is  reached  for 
lnX^Q  »  -7,3,  when  the  mrea,  cross-hatched  in  the  figure, 

(lnV^  -  lnX^N^  »  (-5.8  +  ?.3)*1000  is  just  equal  to  1500.  In 
this  case  only  section  1  is  involved;  the  prospective  browser 
should  spend  all  his  10  minutes  in  section  1;  he  maximizes  his 


•g.J.  Graphical  illustration  of  optimal 
allocation  of  search  effort. 
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expected  success  by  ignor:.ng  the  sections  with  lower  interest 
potential  V.  The  expected  success, 

S  »  "  3  -  0.00069x1000  -  2.3  books 

is  greater  than  could  be  attained  by  devoting  any  part  of  the  . 
ten  minutes  to  any  of  the  other  sections. 

Next  suppose  the  browser  has  30  minutes  at  his  disposal. 

In  that  case  we  place  lnX^Q  so  that  the  enlarged  area  (that 
shaded  plus  that  cross-hatched)  just  equals  pT  «  150x30  *  4500. 

This  happens  to  come  at  lnXjQ  *  which  has  some  area  in 

section  1  and  some  in  section  2,  but  none  in  sections  3  or  4. 

The  relative  times  are  to  be  divided  in  proportion  to  the  areas 
(VeXloV.  -  lnx50) 

time  in  1  «  ^^(-5.8  +  8.22)  *  16  minutes,  evenly  spread  over  1 

time  in  2  *  ^^(-7*8  +  8.22)  *  14  minutes,  evenly  spread  over  2 

Thus  he  should  spend  less  time  in  section  2,  even  though  there 
are  5  times  as  many  books  there,  and  he  should  still  ignore 
sections  3  and  4.  Only  when  he  has  more  than  about  41  minutes 
should  he  start  glancing  at  section  3  and  only  if  he  has  more 
than  85  minutes  to  browse  should  he  bother  with  section  4.  With 
the  30  minute  limit  and  the  16  -  14  division,  his  expected  success 

18  S  -  *1  -  +  E2  -  Xj0N2  -  3-0.27  +  2-1.35  -  5.4  books 

which  is  the  maximum  he  can  expect  to  find  (though  he  may  be 
lucky  some  times,  of  course). 

To  generalize  the  procedure,  we  plot  the  situation  as 
in  Fig*  3t  with  the  m'th  section  represented  by  a  column  of 
height  lnVm  ■  lnC^/N^)  and  with  width  equal  to  Nffi.  We  then 
find  the  horizontal  line,  at  level  lnX^,  for  which  the  area 


between  it  and  the  top  line  of  the  plot  is  just  equal  to  pT, 
with  T  equal  to  the  time  available  for  browsing.  The  time  Tm 
to  be  spent  in  section  m,  and  the  total  expected  success  S  are 
then  given  by  the  equations 

T  r<Vp)(lnVB  -  lnXT>  (it  W 

m’l  0  UfW  (5) 

S  «  +  &2  *  ^^^2  +  •  •  • 

where  the  sum  for  3  includes  only  those  sections  for  which 
V  =  E  /N  is  greater  than  \,n. 

Of  course  it  would  be  foolish  for  the  expectant  browse? 
to  go  through  such  an  analysis  in  detail  before  he  starts 
browsing  (he  would  use  up  all  his  time  just  figuring  out  what 
to  doi).  However  the  essential  point  of  the  exercise  i9  that 
wherever  the  interest  potential  V  =  E/N  is  uniformly  spread 
then  the  search  should  be  uniformly  spread;  and  wherever  the 
interest  potential  is  higher  than  in  other  sections  there  the 
search  effort  should  be  strongly  concentrated,  even  to  the 
extent  of  ignoring  entirely  sections  of  lower  potential.  Thus 
search  allocation  should  be  a  non-linear  function  of  interest 
potential.  Of  course  if  it  is  estimated  that  one  portion  of 
a  section  has  higher  interest  potential  than  the  rest,  then 
this  portion  should  be  considered  a  separate  section  (for  this 
browser)  to  be  searched  much  more  carefully  than  the  rest. 

The  primary  purpose  of  this  analysis  has  been  to  make 
us  familiar  with  the  methods  and  concept?  of  search  theory,  as 
applied  to  libraries;  now  we  can  go  on  to  discuss  the  more 
important  problem,  of  what  the  librarian  can  do  to  improve 
matters  for  all  browsers. 
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The  Librarian* s  Problem 

The  problem  is  relatively  simple  for  each  individual 
library  user.  Though  his  desires  may  change  from  visit  to 
visit,  he  needs  only  to  estimate  the  interest  potential  of 
books  in  various  parts  of  the  library,  in  accord  with  his 
immediate  interests,  and  then  to  allocate  his  search  efforts 
as  has  been  outlined,  concentrating  strongly  on  the  highest 
potential  areas.  It  is  quite  otherwise  for  the  librarian, 
for  the  interests  of  different  browsers  differ  widely;  indeed 
the  interests  of  the  same  browser  vary  widely  from  visit  to 
visit.  Is  there  anything  the  librarian  can  do  to  improve  the 
success  of  all  browsers,  or  at  least  to  improve  the  success  of 
the  average  browser? 

One  thing  is  apparent  immediately;  the  librarian  should 
arrange  his  collections  so  as  to  be  obviously  differentiable, 
in  interest  potential,  to  the  majority  of  library  users.  The 
worst  imaginable  library,  for  a  browser,  would  be  one  in  which 
he  could  not  differentiate  at  all  between  the  interest  potentials 
of  different  sections,  where  he  would  have  to  treat  all  shelves 
as  being  equally  likely  (or,  rather,  equally  unlikely)  to 
produce  what  he  might  want.  That  library  which  makes  it  possible 
for  the  average  browser  to  pick  quickly  a  relatively  few,  rela¬ 
tively  small  sections  of  high  interest  potential  for  his  present 
desires,  so  he  can  ignore  the  rest,  is  the  library  which  is  most 
efficient  for  the  browser  to  use.  The  subdivision  is  not  easy; 
too  fine  a  division  makes  it  necessary  for  the  browser  to  search 
too  many  sections  in  order  to  cover  his  range  of  interest;  he 
should  not  have  to  cover  more  than  about  three  sections  per  trip. 
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H  Parenthetically,  this  is  the  reason  why  card  catalogues  are 
absurdly  ineffective  for  browsing.  Aside  from  the  very  small 
search  rate,  every  drawer  is  more  or  less  equally  sparse  in 
interest  potential;  very  few  interest  spans  go  according  to  the 
alphabet,  even  in  the  subject  catalogue.  It  is  important  for 
the  designers  of  computerized  catalogues  to  realize  that  such  a 
catalogue  also  will  be  spurned  by  the  browser  (and  thu3  will 
have  its  usefulness  seriously  impaired)  if  it  does  not  provide 
for  quick  and  simple  means  of  assembling  sub-catalogues  of  high 
interest  potential,  no  matter  what  the  interest  span  of  the 
potential  browser  may  be.  If  the  computer  can  assemble,  in  a 
minute  or  so,  a  sub-catalogue  of  a  few  thousand  items,  all  of 
high  interest  potential,  of  combinations  of  such  disparate 
subjects  as  entire  functions,  decision  theory,  ideas  of  prob¬ 
ability  in  Hellenic  mathematics  and/or  data  on  book-use  in 

to 

college  libraries,  for  sxample,  with  the  browser  then  able^flick 
through  the  collection  in  five  or  ten  minutes;  only  then  will 
the  computerized  catalogue  begin  to  replace  the  simple  roaming 
through  the  stacks,  which  has  always  been  (and  may  always  be) 

i  , 

the  usual  way  of  finding  what  book  one  wants. 

But  to  return  to  the  librarian's  present  problem.  He 
will  (and  does)  help  the  browser  immensely  by  arranging  the 
books  on  the  shelves,  not  alphabetically,  or  at  random,  but  by 
"subject  class",  so  if  the  user  knows  his  Dewey  or  LC  code  he 
can  quickly  pick  out  the  high-interest-potential  regions  for 
his  present  predelict ions.  The  trouble  comes  when  the  collection 
becomes  too  large  for  all  of  it  to  be  easily  available  to  all, 
when  even  one  class  becomes  so  large  it  cannot  be  scanned 

i 

efficiently  in  a  fraction  of  an  hour. 
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By  this  time,  of  course,  even  in  one  subject  class,  the 
high-interest-potential  items  have  been  diluted  with  a  lot  of 
old  and/or  highly  specialized  books,  of  interest  to  very  few 
library  users,  which  lower  the  interest  potential  of  each  section, 
for  nearly  all  users.  This  may  be  of  little  moment  in  some 
research  libraries,  where  browsing  is  seldom  practiced,  but  in 
most  libraries  this  would  mean  that  browsing  is  no  longer 
efficient  and  hence  is  frustrating.  By  this  time  also,  it 
usually  happens  that  the  collection  has  got  so  large  that  it 
cannot  all  be  kept  in  one  place.  The  question  therefore  is, 
how  to  subdivide  the  subject  classes  so  that  one  portion  regains 
its  original  browsing  effectiveness,  without  at  the  same  time 
destroying  other  utilities. 

One  possibility  is  to  subdivide  by  subject  matter,  to 
establish  instead  of  a  science  library,  for  example,  a  physics 
library,  a  mathematics  library,  etc.  But  this  solution  further 
reduces  the  browser's  chance  of  success.  For  if  the  subject 
groupings  are  left  the  same  size  but  simply  moved  to  separate 
locations,  each  section  will  still  be  diluted  with  low-interest- 
potential  books  and  if  the  browser's  immediate  interests  involve 
both  mathematics  and  physics,  for  example,  he  will  simply  have 
to  walk  further  to  scan  sections  which  are  still  of  low  interest 
potential.  Somehow  the  reconstruction  of  the  library  should 
lead  back  to  more  sections  of  high  interest  potential  for  the 
average  user. 

The  solution  is  not  simple,  but  it  must  involve  a  certain 
amount  of  concentration  of  high-interest-potential  books  in 
some  subject  sections.  Sections  which  have  become  too  large 
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to  browse  through  efficiently  in  the  time  the  average  browser 
can  spend,  should  be  separated,  not  into  subdivisions  according 
to  subject,  but  into  a  high-interest  section  and  a  low-interest 
section.  In  other  words,  some  fraction  of  the  books  in  this 
overlarge  section  should  be  "retired"  to  a  less  accessible 
region  of  the  library.  This  may  perhaps  be  a  disadvantage  to 
a  few  users,  who  may  be  interested  in  the  older  or  more 
specialized  books  (though  it  may  also  be  advantageous  to  him), 
but  it  will  definitely  be  of  advantage  to  the  majority  of  the 
users,  who  can  again  browse  efficiently. 

A  word  needs  to  be  said  here  about  the  size  and  coverage 
of  the  subject  sections  we  speak  of  here.  A  few  specialist 
users  will  want  to  scan  only  tnose  shelves  covering  the  history 
of  the  reign  of  Philip  Augustus,  for  example,  but  the  majority, 
if  they  go  to  the  history  shelves  at  all,  would  tend  to  scan 
all  books  on  French  history,  or  even  all  European  history. 

That  subject  section  which  the  average  user,  in  one  of  his 
visits,  rates  as  having  uniform  interest  potential,  is  what  we 
shall  call  a  uniform  subject  section.  To  the  average  browser 
the  book  he  might  want  may  be  anywhere  in  such  a  section  and 
he  will  tend  (if  the  section  has  not  grown  too  large)  to  scan 
it  uniformly  if  he  scans  it  at  all.  Data  on  actual  usage  (and 
correlation  of  usage)  might  be  collected  to  decide  how  wide  a 
subject  range  should  be  included  in  a  uniform  subject  section, 
for  a  particular  library.  But  most  librarians,  as  well  as  many 
habitual  users  of  a  library,  can  make  estimates  of  appropriate 
subject  range  which  would  be  the  right  order  of  magnitude. 
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Until  further  measurements  are  made,  we  might  assume  that  broad 
subject  categories  (such  as  physics  or  economics  or  ancient 
history)  would  correspond  to  uniform  subject  sections. 

Returning  again  to  the  main  problem,  we  reiterate  that 
whenever  a  uniform  subject  section  becomes  too  large  for  the 
average  browser  to  cover  effectively  in  a  quarter  to  a  half 
hour  (larger  than  about  1000  to  2000  volumes)  it  should  be 
split  into  a  low-use  section  of  "retired"  books  and  a  high-use 
section  for  browsing.  It  is  not  difficult  to  measure  the  degree 
of  use  of  any  individual  book;  if  circulation  is  allowed, a 
book's  circulation  rate  is  a  fairly  good  measure  of  its  "popu¬ 
larity".  Thus  it  is  ressonaLle  to  consider  tnat  the  average 
interest  potential,  for  the  average  browser,  for  a  given 
unifora  subject  section,  is  proportional  to  the  mean  circulation 
rate  of  the  books  in  the  section.  If  the  librarian  can  make 
hia  split  so  as  to  have  the  mean  circulation  rate  of  the 
browsing  portion  considerably  higher  than  that  for  the  less 
accessible  part,  he  will  have  made  the  task  of  the  average 
browser  much  more  awarding.  As  mentioned  before,  optimal 
allocation  of  search  effort  is  highly  non-linear;  a  split  which 
raises  the  interest  potential  by  as  little  as  50  percent  may 
make  it  worthwhile  for  many  more  browsers  to  scan  it,  though 
they  would  (and  should)  have  ignored  the  previous,  unseparated 
section. 

It  is  thus  assumed  that  the  mean  circulation  U  of  the 
books  in  a  section  is  proportional  to  the  mean  value  of  the 
interest  potential  of  the  section  for  the  users  who  scan  it 

•*  «u»  7  .  CB  or  B  .  C(HB) 


(6) 
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Here  E  is  the  mean  value  of  the  a  priori  estimate  of  books  in 
the  section  that  are  likely  to  be  of  immediate  interest  tc  the 
browser,  averaged  over  those  who  browse  in  the  section; 

V  «  E/N  is  the  mean  interest  potential,  averaged  over  the  same 
users;  and  N$  is  the  total  yearly  circulation  of  the  section. 
The  exact  value  of  constant  C  is  not  important  for  our  present 
uses;  we  can  conveniently  take  it  to  be  about  0.001.  tie  also 
assume  that  the  chance  of  a  particular  book  being  the  one  a 
browser  picks  out  is  similarly  proportional 

to  the  particular  book's  yearly  circulation,  R; 

v  «  Crt  •,  C  *  0.001  (?) 

where  v  might  be  called  the  book  potential  of  the  individual 
book.  Thus  7  is  the  average  of  the  individual  book  potentials 
of  all  the  books  in  the  section. 

There  has  been  some  study  of  the  distribution  of  be  ks 

according  to  their  circulation'.  For  the  purposes  of  this 
paper  we  need  not  go  into  detail,  since  changes  of  factors  of 
1.5  or  2  are  the  only  ones  worth  considering  here.  To  this 
approximation  we  can  assume  that  the  number  of  books,  in  a 
uniform  subject  section,  which  have  book  potential  greeter  than 
v,  is  Ne“v^»  where  M  is  the  number  of  books  in  the  section  and 
7  is  given  by  nq.(6).  Thus  the  estimated  number  of  books  in 
the  section  with  book  potential  between  v  and  v  ♦  dv  is 

Nf(v)dv  •  2  e~v^dv  ;  |f(v)dv  3  i|e”v>^dv  •  1  (9) 

where  f(v)  is  the  probability  density  that  a  book  has  book 
potential  in  dv  at  v.  The  mean  interest  potential  of  tm 
section  is  thus  _ 

jvf(v)dv  .  e“v^dv  .  7  •  C5 

»»  e 


(9) 
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An  Optimal  Retirement  Flan 

Thus  the  most  effective  way  to  separate  off  a  high- 
interest~potentisl,  browsing  section  from  an  overlarge  subject 
section  would  be  according  to  circulation  rate.  The  "retired* 
section  would  be  the  fraction  (1-x)  of  books  having  bock 
potent jnl  ranging  from  0  to  vQ  and  the  " reconcent rated”  sectiGn 
would  be  the  remaining  xN  books  (where  R  is  the  siae  of  the 
undivided  section},  each  one  having  book  potential  greater  than 
v  .  Of  course  v  is  related  to  x  by  the  requirement  that 

^f(v)  dv  *  e~v°^  ■' *  x  or  vQ  *  7  ln(l/x)  _  (10) 

V  .  ■ 

The  mean  interest  potential  of  the  "reconcentrated"  section  is 

then  i  f  -i 

7r  *  ~jvf(v)dv  *7  +  vQ  *  7^1  -f  ln(l/x)j  (11) 

indice ting  an  enhancement  of  the  mean  interest  potential  by  the 
factor  in  square  brackets.  The  mean  interest  potential  of  the 
"retired"  section  is  r  „  n  V 

^  -  A  *<i>J 

displaying  a  corresponding  reduction  (not  very  much  if  x  is  small). 

The  mean  interest  potential  of  the  "reconcentrated" 
section  is  increased,  but  at  the  cost  of  reducing  the  total 
number  of  books  to  be  scanned  and  inevitably  of  reducing  the 
estimated  number  of  books  of  immediate  interest  to  the  average 
browser,  for  1  equals  7  times  the  number  of  books  in  the  section. 
\  *  xN7p  -  3x[l  +  ln(  1/x )]  (12) 

where  E  »  N7  ■  C(NS)  is  the  estimated  number  of  books  of  interest 
to  the  average  browser  in  the  original,  undivided  section.  The 
quantity  x[l  +  ln(l/x|  is  less  than  unity  for  all  x  between  0 
and  1.  Even  if  we  retire  only  books  with  the  lowest  circulation 
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rate,  we  will  always  retire  some  books  which,  once  in  a  while, 
would  be  of  interest  to  some  browser.  The  reduction  is  not 
very  great  if  we  retire  only  a  few  books  (i.e.,  if  x  is  nearly 
unity)  but  then  we  would  not  have  increased  the  mean  interest 
potential  by  very  much.  The  reduction  becomes  quite  apparent 
if  x  is  quite  small;  we  would  have  increased  the  mean  interest 
potential  of  the  remaining  books  at  the  acpense  of  depriving  the 
browser  of  the  chance  to  see  a  number  of  books  he  might 
occasionally  be  interested  in. 

To  find  the  optimal  middle  ground  we  have  recourse 
again  to  the  search  formulas  (3)  and  (4),  which  hold  for  each 
browser.  If  the  reconcentrated  section  is  still  so  large  that 
the  average  browser  cannot  efficiently  scan  the  whole  section 
in  the  time  he  can  spend,  then  the  fact  that  Sr  is  larger  will 
not  help,  for  he  hasn't  the  time  to  find  the  books  of  interest 
among  all  the  others.  If  x  is  made  too  small  the  average  browser 
will  "oversearch"  the  small  collection,  but  will  miss  some  of 
the  books  which  have  been  retired,  oomewhere  between  is  an 
optimum  size  that  will  maximize  the  expected  success  for  the 
average  browser. 

If  the  browser  spends  time  t  in  the  reconcentrated 
section  his  expected  success  is 

Sr  *  Sr(l  -  e~pt/xN)  *  lx[l  +  ln(i)](l  -  e”pt/xN)  (13) 

where  p  is  the  average  of  the  browser’s  search  rate  and  xN  is 
the  sizs  of  the  reconcentrated  section.  Bata  on  length  of  stay 
in  the  library  indicates  that  it  is  distributed  exponentially^. 

If  the  mean  time  spent  browsing  in  the  section  under  study  is 
f,  the  probability  that  a  person  spends  between  t  and  t  +  dt  there 


22 


during  one  visit  is  (l/T)e 
search  factor  (1  -  e"P^/xN 

1  f(l  -  e-Pt/ld,)e-t/Tdt 


— t/T 

dt  and  the  mean  value  of  the 

)  is 

_  1  =  (P?/N) 

l  +  CpT/xN)  x  +  (pT/N) 


o 

Thus  the  mean  value  of  the  expected  success,  averaged 
over  all  browsers,  for  a  reconcentrated  section  containing  the 

tit  Hie 


fraction  x,  of  the. original  section,  which  have  the  higher 

A 

circulation  rate,  is 


3r  .  Sxfl  ♦  i  Y  =  $  <w> 

According  to  the  earlier  discussion,  pT  is  the  number  of  books 
which  can  be  scanned  with  about  70  percent  efficiency  in  time 
T,  the  mean  time  a  browser  spends.  And  y  »  pT/N  is  the  fraction 
of  the  original,  undivided  section  which  the  average  browser  can 

(ft,  7c  fires.,*') 

scan  adequately Ain  the  average  time  he  allocates  to  this  section. 
Parameter  y  can,  of  course,  be  larger  then  unity,  in  which  case 
the  section  is  small  enough  so  there  is  no  need  to  subdivide  it . 

Figure  4  shows  the  behavior  of  the  function  3r/S,  for 
different  values  of  y.  The  search  factor  y/(x  +  y),  responding 
to  the  fact  that  the  larger  the  section  the  less  meticulous  can 
be  the  search,  is  unity  at  x  *  0  and  decreases  as  x  approaches 
1,  first  slowly  and  then  more  rapidly.  The  factor  x[l  +  ln(l/x)3  , 
measuring  the  expected  number  of  books  of  potential  interest  in 
the  concentrated  fraction  x,  rises  quickly  from  zero  as  x  rises 
from  zero  and  approaches  1  as  x  approaches  1.  The  product, 

Sr/E,  has  a  maximum  somewhere  between  0  and  1,  unless  y,  the 
search  density, is  very  large  (in  which  case  the  optimum  value 
of  x  is  unity  and  there  is  no  need  to  divide  the  section).  But 
if  y  is  less  than  about  2,  the  average  browser,  during  his 


Fi$.4.  Changes  in  expected  success  as  one  changes  the 

fraction  x%of  a  uniform  potential  section*  that  is 
left  in  a  reconcentrated,  browsing  section,  when 
retirement  criterion  is  low  circulation. 
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average  stay,  cannot  scan  the  full  section  effectively,  the 
optimal  value  of  x,  xQ,  is  less  than  1  and  there  is  some 
advantage  in  breaking  the  section  into  a  '‘retired"  section 
and  a  "reconcentrated"  section  containing  x0N  books.  The 
advantage  is  not  very  great  if  y  is  not  much  less  than  1,  but 
if  y  is  less  than  0.1  the  possible  improvement  is  a  factor  of 
2  or  better,  which  is  definitely  advantageous  for  the  browser. 

Expression  (14)  can  be  differentiated  to  find,  for 
different  values  of  Y»  the  optimum  value  of  x,  the  fraction  of 
the  original  section  which  would  yield  the  greatest  success 
for  the  average  browser.  It  is  the  solution  of  the  equation 


InU/xJ 
x  ♦  Y 


1  +  lnCl/x] 

~uTyt 


Y  ln(l/xQ) 


and  for  this  xQ,  (3p/S)  *  xQ(l  +  i) 


(15) 


The  optimal  value  of  Sp  is  then  !ix0,  which  is  to  be  compared 
with  the  value  §  ■  !y/(1  +  Y)  for  the  undivided  section  (x  *  1). 


THs  optimal  browsing  fraction  xQ  is  plotted  in  Fig. 5  and  the 
advantage  3r/3  gained  by  the  division  is  plotted  in  Fig. 6,  both 
as  functions  of  Y« 


A  few  examples  may  show  how  it  can  be  used.  Suppose 
p  is  150  and  T  is  5  minutes  (this  may  seem  short,  but  many 
browsers  scan  several  different  sections  in  a  visit)  or  p  «  100 
and  T  is  7*5  minutes;  in  any  case  pT  is  750  and  y  •  750/N.  Now 
suppose  the  undivided  section  has  N*  1000  volumes.  In  this 
esse  y  -  0,75  and  the  optimal  xQ  is  about  0.5;  we  should  retire 
half  the  collection.  However  we  would  only  gain  about  20  percent 
advantage  ( Sr/3  ■  1.2);  it  is  doubtful  whether  this  is  worth 
the  trouble  of  dividing  the  section.  If  there  are  about  2  books 


i  \ 
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of  immediate  interest  per  thousand  volumes  of  the  initial 
collection*  the  expected  success  would  be  about  1  volume  either 
from  the  full  section  or  the  concentrated  half-section. 

On  the  other  hand,  if  N  were  5000,  y  would  be  0.15, 
for  which  xQ  *  0.25;  the  optimal  browsing  section  would  be  the 
most  popular  1200  books  out  of  the  5000.  The  change  in  success 
expected  would  be  by  a  factor  3r/S  «  1.7,  probably  a  worthwhile 
improvement.  If  there  were  about  1  book  of  interest  per  500 
volumes, for  the  average  browser, in  the  original  section,  3 
for  the  full  section  would  be  about  1.5;  Sr  for  the  quarter- 
sized,  reconcentrated  section  would  be  2.5. 

Finally,  suppose  N  were  30,000,  all  in  a  collection 
homogeneous  enougn  to  make  it  difficult  for  the  average  user 
to  distinguish  one  part  from  another  for  browsing.  The  formulas 
indicate  that  this  outsize  section  should  be  thinned  by  retire¬ 
ment  to  a  browsing  collection  of  the  2000  highest-circulation 
books,  which  would  have  a  search  advantage  over  the  full  coll¬ 
ection  of  a  factor  of  2.7,  raising  the  expected  average  success 
from  3  ■  1.5  for  the  full  collection  to  3r  «  4  for  the 
reconcentrated  section,  a  change  definitely  advantageous  for 
the  average  browser.  As  mentioned  earlier,  a  search  advantage 
of  1.5  or  better  (corresponding  to  a  y  of  0.25  or  smaller) 
would  probably  justify  dividing  a  subject  section,  if  browsing 
is  an  important  factor  and  if  the  division  can  be  made  according 
to  circulation. 


!  I 
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Hetiring  Books  by  Age 

The  procedure  of  retirement  according  to  circulation 
is  probably  optimal.  However  continual  retirement  of  low- 
circulation  books  from  a  browsing  section  demands  a  greater 
awareness  of  book  circulation  than  most  libraries  have  at 
present.  Let  us  see  what  can  be  done  if  the  over-size  section 
is  divided  on  the  basis  of  age.  Measurement^  has  shown  that 
the  distribution  in  circulation  (and  thus  in  potential  interest) 
of  books  which  have  been  on  the  shelf  for  t  years  is,  very 
approximately, 

Probability  that  a  book  of  shelf  age  t  has  a  book 
potential  greater  than  v  is  e“v^'f*:o)/voto 
where  vQ  is  the  book's  potential  during  its  first  year  on  the 
shelf  and  tQ  is  a  parameter  typical  of  the  class  of  book  and 
of  the  average  user  of  the  library.  In  a  science  library  tQ 
may  be  1  for  physics  books  and  2  for  mathematics  books,  for 
example;  books  on  history  may  have  tQ  as  large  as  10  or  20. 

The  mean  book  potential  for  a  book  of  shelf  age  t  would  be 
▼0t0/(t*t0)  if  vQ  weve  its  potential  during  its  first  shelf 
year  and  tQ  were  the  parameter  for  books  of  its  class.  Thus 
the  larger  tQ  is,  the  slower  does  the  class  of  books  decrease 
in  book  potential. 

If  the  uniform-interest  section  under  study  contains 
books  more  or  l'iss  equally  distributed  in  age  from  the  most 
recent  acquisitions  to  the  oldest  with  shelf  age  tB,  then  the 
probability  that  a  book,  taken  at  random  from  the  section,  has 
book  potential  v  or  greater  is 
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P(*v)  -  f  e-v(t*t0)/?0t0  dt 

■  9 

,  Zfliflje"(v/Vo)  -  ^▼/90to><V*‘tm)] 


(16) 


where  7Q  is  the  mean  book  potential  of  all  books  of  the  section 
during  their  first  year  of  shelf  life.  The  probability  density 
corresponding  to  the  f(v)  of  i£q.(ll)  is  f(v)  •  -dP(^  v)/dv. 

The  nean  interest  potential  of  this  collection  of  books  is 

9  -  jvf(v)dv  w  jp(*  v)dv  -  in(i  ♦  (17) 

which  is  less  than  VQ  if  tB  is  larger  than  tQ,  i.e. ,  if  there  is 
an  appreciable  fraction  of  older  books  in  the  section.  Now 
suppose  we  pick  from  these  a  browsing  collection,  with  xN  volumes, 
by  keeping  all  the  books  of  shelf  age  xtB  or  less  and  retiring 
the  rest  to  a  less  accessible  location.  It  is  not  difficult 
to  see  that  the  mean  interest  potential  of  this  collection  is 

”r  •  ^ 10(1  J  (18) 

and  the  expected  number  of  books  of  immediate  interest  to  the 
average  browser  is  ^  ■  rN?r.  Substituting,  we  see  thst 


*»(yt9a 

^  infx  <VVi 


(19) 


wb*r«  i .  y»  is  the  expected  number  of  books  of  immediate 
interest  in  the  full  collection.  This  drops  in  value  from  1 
to  aero  as  x  goes  from  1  to  0,  just  as  does  the  factor 
x[l  +  ln(l/x)]  of  £qs.(12)  and  (14). 

Again  we  introduce  the  search  factor,  as  in  ft*. (14), 
and  obtain  the  expected  success  if  the  everage  browser  spends 
average  time  9  scanning  the  reeoneentrated  section, 


As  with  the  function  of  Eq.(14),  for  the  '* retirement  by  circ¬ 
ulation  procedure”,  this  function  has  a  maximum,  at  x *  1  if  y 
is  somewhat  larger  than  1,  for  x  less  than  1  if  y  is  less 
than  1.  The  equatioa,giving  the  optimal  value  xQ  and  the 
expected . success  for  the  full  collection,  S,  and  for  the 
reconcentrated  section,  3y,  are 

PY  -  (1  ♦  xQ0)ln(l  ♦  xQp)  -  xQp  — ►;(Px0)2  (PY  <0.1) 

(21) 


Values  of  0xQ  are  plotted  against  pY  in  Fig. 7  for  this 
leas  efficient  retirement  plan,  and  values  of  the  expected 
search  advantage  Sr/5  are  shown  in  Fig. 8.  Again  a  few  examples 
are  in  order,  in  order  to  compare  results  with  those  of  £q.(13) 
for  the  more  .efficient  plan.  Here  we  must  distinguish  between 
rapidly  aging  books  (tQ  small,  p  large)  and  slowly  aging  books 
(tQ  large,  0  small).  *e  assume  ,  for  the  example,  that  the 
undivided  section  has  books  of  all  shelf  age  from  0  to  20,  ao 
ta  •  20.  We  take  0*2  (tQ*10)  for  the  slowly  aging  example 
and  0  ■  10  (t0»2)  for  the  rapidly  aging  example  and,  aa  before 
we  assume  that  p9  •  730,  so  that  y  ■  730/N.  For  comparison  ws 

«ft 

assume  that  there *about  2  volumes, of  immediate  interest  to  the 
average  browser,  per  1000  volumes  in  the  undivided  section. 

The  section  with  1000  books  will  certainly  not  be  worth 
dividing^  so  we  start  with  the  example  where  the  undivided 
section  has  3000  books,  so  y  •  0.13,  1  •  10  and  3  •  1.3.  For 
the  case  0  -  2,  0y  -  0.3,  0xQ  •  0.9,  xQ  •  0.43  and  3r/S  •  1.1$ 


Pig*7»  Optimal  fraction  x  as  function  of  coverage 
parameter  0,  when  retirement  criterion  is  shelf 


33 


the  gain  from  §  *  1.3  to  Sr  *  1*4  by  retiring  the  oldest  half 
of  the  collection,  ia  not  worth  the  trouble.  On  the  other  hand 
if  the  collection  was  rapidly  aging, »  10,  then  0y  *  1.5* 
px0  o  2.5,  *  0.25  and  3^/3  «  1.4;  the  gain  from  1.3  to  1.3 

in  S  may  be  worth  the  trouble  of  retiring  all  but  1250  of  the 
youngest  books.  Comparison  with  the  similar  examples  in  the 
earlier  discussion  shows  that  retirement  by  age  is  not  as  good 
for  browsers  as  retirement  by  circulation,  particularly  for  the 
slowly-aging  books  (p  less  than  5). 

b'ven  if  the  uniform  subject  section  consists  initially 
of  30,000  volumes,  separation  by  age  does  not  help  much  for  the 
slowly-aging  classes.  Here  Y  a  0.025,  3  *  60  and  3  *  1.5  as 
before.  For  £  *  2  we  hsve  jiY  *  0.05,  pxQ  *  0.34,  xQ  *  0.1? 
and  3r/3  *  1,4;  increasing  expected  success  from  1.5  to  2  by 
retiring  all  but  5000  of  the  youngest  volumes  may  just  be  worth 
while  (particularly  if  the  collection  must  be  split  because  of 
shortage  of  space).  On  the  other  hand  if  the  collection  is 
rapidly  aging,  with  p  =  10,  then  PY  •  0.25,  PxQ  »  0.8,  xQ  *  0.08 
and  Sr/S  *  2.4;  reducing  the  browsing  collection  to  the  roughly 
2500  books  no  more  than  2  years  old  will  increase  the  expected 

a.  gutter 

success  for  the  average  browser  from  1.5  to  3.6,A  This  is  a 
definite  gain,  though  not  as  great  as  the  factor  2.8  which 
would  be  obtained  if  the  separation  were  on  the  basis  of 
circulation.  Retirement  by  age  retires  some  high-potential 
books  simply  because  they  are  older  and  leaves  in  the  browsing 
collection  too  many  low-potential  books  simply  because  they 
are  younger.  However  the  efficiency  of  separation  by  age  is 
not  too  bad  for  rapidly-aging  book  classes.  If  we  say  that  a 

. 
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search  advantage  Sr/5  of  1.5  or  more  makes  it  worth  while  to 
separate  off  a  browsing  section  then,  for  classes  with  £  .  10, 

Y  should  be  less  than  0.15*  for  p  *  2,y  should  be  less  than 
0.01  before  browsing  advantage  would  make  separation  worth  while. 

These  criteria  will  become  easier  to  determine  the  more 
accurately  the  parameters  p,  p,  $  are  determined  for  the  library 
and  its  users,  and  the  more  precisely  one  can  delimit  the  various 
uniform  subject  sections  of  the  library.  Also  the  separation 
of  the  over-large  sections  into  high-potential,  browsing  sections 
and  more-secluded  sections  for  the  less-used  volumes,  will 
become  easier  when  means  are  devised,  by  computer  or  otherwise, 
to  keep  continuous  track  of  the  circulation  rates  of  the  books 
in  the  library. 


i 


i  « 
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References  and  Notes 


xFor  the  application  of  search  theory  to  antisubmarine  search, 
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MIT  Press  and  John  Wiley  and  Eons,  New  York,  1951*  and 
B.O.Koopman,  Operations  Research,  4,296,19%  and  4,503,19%. 
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This  formula,  in  the  context  of  antisubmarine  search,  is 
derived  in  B.O.Koopman,  Operations  Research, 613, 195?»  which 
summarizes  work  carried  out  during  World  War  II. 

z 

^See,  for  example,  H.Chernoff  and  L. Moses,  Elementary  Decision 
Theory .  John  Wiley  and  Sons,  New  York,  1959. 

For  further  discussion  of  this  point,  see  the  paper  "Search 
and  Browsing"  by  P.M. Morse  in  the  forthcoming  "Festschrift” 
for  Jesse  Shera,  to  be  published  by  th'  Case  Western 
Reserve  Press. 

^See,  for  example,  P.M. Morse,  Library  Effectiveness  ,  MIT  Press, 
Cambridge,  Mass.,  1968  and  other  references  given  there. 

£ 

Of  course  the  individual  books  which  would  be  of  immediate 
interest  to  one  browser  would  differ  from  those  of  immediate 
interest  to  another.  If  neither  browser  knows  where  in  the 
section  his  "wanted"  volumes  are,  we  are  justified  in  taking  the 
average  of  the  number  of  such  books  in  the  section,  as 
estimated  by  each  browser  (even  though  each  estimate  refers 
to  different  books), as  the  value  of  E  for  the  section. 
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